Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 117405 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1096 |
| Duplicate rows (%) | 0.9% |
| Total size in memory | 20.4 MiB |
| Average record size in memory | 182.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 11 |
| Boolean | 6 |
Reason has constant value "TB" | Constant |
Is_year_start has constant value "False" | Constant |
| Dataset has 1096 (0.9%) duplicate rows | Duplicates |
Id has a high cardinality: 22217 distinct values | High cardinality |
Applied is highly correlated with Received and 2 other fields | High correlation |
Received is highly correlated with Applied and 2 other fields | High correlation |
logapplied is highly correlated with Applied and 2 other fields | High correlation |
logreceived is highly correlated with Applied and 2 other fields | High correlation |
Year is highly correlated with Elapsed | High correlation |
Month is highly correlated with Week and 1 other fields | High correlation |
Week is highly correlated with Month and 1 other fields | High correlation |
Dayofyear is highly correlated with Month and 1 other fields | High correlation |
Elapsed is highly correlated with Year | High correlation |
Gender is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_month_end is highly correlated with Is_year_start and 1 other fields | High correlation |
True_False is highly correlated with Is_year_start and 1 other fields | High correlation |
Payment_Method is highly correlated with Is_year_start and 2 other fields | High correlation |
Location is highly correlated with Is_year_start and 1 other fields | High correlation |
Area is highly correlated with Is_year_start and 1 other fields | High correlation |
AgeGroup is highly correlated with Is_year_start and 2 other fields | High correlation |
Is_month_start is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_quarter_start is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_year_end is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_year_start is highly correlated with Gender and 14 other fields | High correlation |
Year is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_quarter_end is highly correlated with Is_year_start and 1 other fields | High correlation |
Age is highly correlated with AgeGroup and 2 other fields | High correlation |
Payment_Type is highly correlated with Payment_Method and 2 other fields | High correlation |
Reason is highly correlated with Gender and 14 other fields | High correlation |
Ratio is highly skewed (γ1 = -135.0970658) | Skewed |
Dayofweek has 21122 (18.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-26 18:49:36.925979 |
|---|---|
| Analysis finished | 2021-04-26 18:51:49.787225 |
| Duration | 2 minutes and 12.86 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 2043 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 920.5182914 |
|---|---|
| Minimum | 1 |
| Maximum | 21060 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 134 |
| Q1 | 360 |
| median | 800 |
| Q3 | 1350 |
| 95-th percentile | 2120 |
| Maximum | 21060 |
| Range | 21059 |
| Interquartile range (IQR) | 990 |
Descriptive statistics
| Standard deviation | 645.3930638 |
|---|---|
| Coefficient of variation (CV) | 0.7011192171 |
| Kurtosis | 8.808821341 |
| Mean | 920.5182914 |
| Median Absolute Deviation (MAD) | 480 |
| Skewness | 1.081168702 |
| Sum | 108073450 |
| Variance | 416532.2068 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 2785 | 2.4% |
| 1200 | 2669 | 2.3% |
| 600 | 2612 | 2.2% |
| 210 | 2256 | 1.9% |
| 1400 | 2181 | 1.9% |
| 800 | 1821 | 1.6% |
| 500 | 1780 | 1.5% |
| 1800 | 1715 | 1.5% |
| 900 | 1547 | 1.3% |
| 400 | 1540 | 1.3% |
| Other values (2033) | 96499 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 4 | |
| 3 | 4 | |
| 4 | 1 | < 0.1% |
| 5 | 2 |
| Value | Count | Frequency (%) |
| 21060 | 1 | |
| 11220 | 1 | |
| 6140 | 1 | |
| 6000 | 1 | |
| 4648 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| F | |
|---|---|
| M | |
| GD | 31 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000264043 |
| Min length | 1 |
Characters and Unicode
| Total characters | 117436 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | F |
| Value | Count | Frequency (%) |
| F | 75095 | |
| M | 42279 | |
| GD | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| f | 75095 | |
| m | 42279 | |
| gd | 31 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 75095 | |
| M | 42279 | |
| G | 31 | < 0.1% |
| D | 31 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 117436 |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 75095 | |
| M | 42279 | |
| G | 31 | < 0.1% |
| D | 31 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 117436 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 75095 | |
| M | 42279 | |
| G | 31 | < 0.1% |
| D | 31 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117436 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 75095 | |
| M | 42279 | |
| G | 31 | < 0.1% |
| D | 31 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| AV | |
|---|---|
| RP | |
| U | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.999991482 |
| Min length | 1 |
Characters and Unicode
| Total characters | 234809 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | AV |
|---|---|
| 2nd row | RP |
| 3rd row | AV |
| 4th row | AV |
| 5th row | AV |
| Value | Count | Frequency (%) |
| AV | 100828 | |
| RP | 16576 | 14.1% |
| U | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| av | 100828 | |
| rp | 16576 | 14.1% |
| u | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16576 | 7.1% |
| P | 16576 | 7.1% |
| U | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 234809 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16576 | 7.1% |
| P | 16576 | 7.1% |
| U | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 234809 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16576 | 7.1% |
| P | 16576 | 7.1% |
| U | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 234809 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16576 | 7.1% |
| P | 16576 | 7.1% |
| U | 1 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| M | |
|---|---|
| NE | |
| O | |
| PP | |
| U | 4247 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.410936502 |
| Min length | 1 |
Characters and Unicode
| Total characters | 165651 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | NE |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
| Value | Count | Frequency (%) |
| M | 52406 | |
| NE | 36752 | |
| O | 12506 | 10.7% |
| PP | 11494 | 9.8% |
| U | 4247 | 3.6% |
| Value | Count | Frequency (%) |
| m | 52406 | |
| ne | 36752 | |
| o | 12506 | 10.7% |
| pp | 11494 | 9.8% |
| u | 4247 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 52406 | |
| N | 36752 | |
| E | 36752 | |
| P | 22988 | |
| O | 12506 | 7.5% |
| U | 4247 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 165651 |
Most frequent character per category
| Value | Count | Frequency (%) |
| M | 52406 | |
| N | 36752 | |
| E | 36752 | |
| P | 22988 | |
| O | 12506 | 7.5% |
| U | 4247 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 165651 |
Most frequent character per script
| Value | Count | Frequency (%) |
| M | 52406 | |
| N | 36752 | |
| E | 36752 | |
| P | 22988 | |
| O | 12506 | 7.5% |
| U | 4247 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 165651 |
Most frequent character per block
| Value | Count | Frequency (%) |
| M | 52406 | |
| N | 36752 | |
| E | 36752 | |
| P | 22988 | |
| O | 12506 | 7.5% |
| U | 4247 | 2.6% |
| Distinct | 3645 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 920.5135505 |
|---|---|
| Minimum | 1 |
| Maximum | 21060 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 134 |
| Q1 | 360 |
| median | 800 |
| Q3 | 1350 |
| 95-th percentile | 2120 |
| Maximum | 21060 |
| Range | 21059 |
| Interquartile range (IQR) | 990 |
Descriptive statistics
| Standard deviation | 645.3905099 |
|---|---|
| Coefficient of variation (CV) | 0.7011200536 |
| Kurtosis | 8.808916907 |
| Mean | 920.5135505 |
| Median Absolute Deviation (MAD) | 480 |
| Skewness | 1.081163686 |
| Sum | 108072893.4 |
| Variance | 416528.9102 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 2785 | 2.4% |
| 1200 | 2669 | 2.3% |
| 600 | 2612 | 2.2% |
| 210 | 2255 | 1.9% |
| 1400 | 2181 | 1.9% |
| 800 | 1819 | 1.5% |
| 500 | 1775 | 1.5% |
| 1800 | 1715 | 1.5% |
| 900 | 1547 | 1.3% |
| 400 | 1540 | 1.3% |
| Other values (3635) | 96507 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 3 | |
| 2.1 | 1 | < 0.1% |
| 2.5 | 2 | |
| 2.58 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 21060 | 1 | |
| 11220 | 1 | |
| 6140 | 1 | |
| 6000 | 1 | |
| 4647.9 | 1 |
| Distinct | 22217 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| GHI000112669 | |
|---|---|
| GHI000753413 | 1149 |
| GHI001206283 | 986 |
| GHI000143648 | 768 |
| GHI000134418 | 576 |
| Other values (22212) |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 1408860 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14612 ? |
|---|---|
| Unique (%) | 12.4% |
Sample
| 1st row | GHI000112669 |
|---|---|
| 2nd row | GHI000780038 |
| 3rd row | GHI000437510 |
| 4th row | GHI000140582 |
| 5th row | GHI001420547 |
| Value | Count | Frequency (%) |
| GHI000112669 | 11022 | 9.4% |
| GHI000753413 | 1149 | 1.0% |
| GHI001206283 | 986 | 0.8% |
| GHI000143648 | 768 | 0.7% |
| GHI000134418 | 576 | 0.5% |
| GHI000437510 | 552 | 0.5% |
| GHI001853440 | 549 | 0.5% |
| GHI001086470 | 541 | 0.5% |
| GHI000086558 | 538 | 0.5% |
| GHI000100619 | 526 | 0.4% |
| Other values (22207) | 100198 |
| Value | Count | Frequency (%) |
| ghi000112669 | 11022 | 9.4% |
| ghi000753413 | 1149 | 1.0% |
| ghi001206283 | 986 | 0.8% |
| ghi000143648 | 768 | 0.7% |
| ghi000134418 | 576 | 0.5% |
| ghi000437510 | 552 | 0.5% |
| ghi001853440 | 549 | 0.5% |
| ghi001086470 | 541 | 0.5% |
| ghi000086558 | 538 | 0.5% |
| ghi000100619 | 526 | 0.4% |
| Other values (22207) | 100198 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 392591 | |
| 1 | 130709 | 9.3% |
| G | 117405 | 8.3% |
| H | 117405 | 8.3% |
| I | 117405 | 8.3% |
| 6 | 81309 | 5.8% |
| 2 | 70778 | 5.0% |
| 9 | 68849 | 4.9% |
| 4 | 67571 | 4.8% |
| 8 | 63924 | 4.5% |
| Other values (3) | 180914 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1056645 | |
| Uppercase Letter | 352215 | 25.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 392591 | |
| 1 | 130709 | 12.4% |
| 6 | 81309 | 7.7% |
| 2 | 70778 | 6.7% |
| 9 | 68849 | 6.5% |
| 4 | 67571 | 6.4% |
| 8 | 63924 | 6.0% |
| 3 | 61966 | 5.9% |
| 5 | 61074 | 5.8% |
| 7 | 57874 | 5.5% |
| Value | Count | Frequency (%) |
| G | 117405 | |
| H | 117405 | |
| I | 117405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1056645 | |
| Latin | 352215 | 25.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 392591 | |
| 1 | 130709 | 12.4% |
| 6 | 81309 | 7.7% |
| 2 | 70778 | 6.7% |
| 9 | 68849 | 6.5% |
| 4 | 67571 | 6.4% |
| 8 | 63924 | 6.0% |
| 3 | 61966 | 5.9% |
| 5 | 61074 | 5.8% |
| 7 | 57874 | 5.5% |
| Value | Count | Frequency (%) |
| G | 117405 | |
| H | 117405 | |
| I | 117405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1408860 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 392591 | |
| 1 | 130709 | 9.3% |
| G | 117405 | 8.3% |
| H | 117405 | 8.3% |
| I | 117405 | 8.3% |
| 6 | 81309 | 5.8% |
| 2 | 70778 | 5.0% |
| 9 | 68849 | 4.9% |
| 4 | 67571 | 4.8% |
| 8 | 63924 | 4.5% |
| Other values (3) | 180914 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| TB |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 234810 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TB |
|---|---|
| 2nd row | TB |
| 3rd row | TB |
| 4th row | TB |
| 5th row | TB |
| Value | Count | Frequency (%) |
| TB | 117405 |
| Value | Count | Frequency (%) |
| tb | 117405 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 117405 | |
| B | 117405 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 234810 |
Most frequent character per category
| Value | Count | Frequency (%) |
| T | 117405 | |
| B | 117405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 234810 |
Most frequent character per script
| Value | Count | Frequency (%) |
| T | 117405 | |
| B | 117405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 234810 |
Most frequent character per block
| Value | Count | Frequency (%) |
| T | 117405 | |
| B | 117405 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| 25-29 | |
|---|---|
| 20-24 | |
| 30-34 | |
| 35-39 | |
| 40-44 | |
| Other values (8) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.885268941 |
| Min length | 2 |
Characters and Unicode
| Total characters | 573555 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 40-44 |
|---|---|
| 2nd row | 18-19 |
| 3rd row | 35-39 |
| 4th row | 55-59 |
| 5th row | 30-34 |
| Value | Count | Frequency (%) |
| 25-29 | 21016 | |
| 20-24 | 18587 | |
| 30-34 | 16859 | |
| 35-39 | 12991 | |
| 40-44 | 10506 | |
| 45-49 | 9398 | |
| 50-54 | 7349 | 6.3% |
| 65+ | 5973 | 5.1% |
| 55-59 | 5690 | 4.8% |
| 18-19 | 4414 | 3.8% |
| Other values (3) | 4622 | 3.9% |
| Value | Count | Frequency (%) |
| 25-29 | 21016 | |
| 20-24 | 18587 | |
| 30-34 | 16859 | |
| 35-39 | 12991 | |
| 40-44 | 10506 | |
| 45-49 | 9398 | |
| 50-54 | 7349 | 6.3% |
| 65 | 5973 | 5.1% |
| 55-59 | 5690 | 4.8% |
| 18-19 | 4414 | 3.8% |
| Other values (3) | 4622 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 110924 | |
| 4 | 97223 | |
| 5 | 81146 | |
| 2 | 79206 | |
| 3 | 59700 | |
| 0 | 57415 | |
| 9 | 53509 | |
| 6 | 14299 | 2.5% |
| 1 | 9336 | 1.6% |
| + | 5973 | 1.0% |
| Other values (2) | 4824 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 456658 | |
| Dash Punctuation | 110924 | 19.3% |
| Math Symbol | 5973 | 1.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 4 | 97223 | |
| 5 | 81146 | |
| 2 | 79206 | |
| 3 | 59700 | |
| 0 | 57415 | |
| 9 | 53509 | |
| 6 | 14299 | 3.1% |
| 1 | 9336 | 2.0% |
| 8 | 4414 | 1.0% |
| 7 | 410 | 0.1% |
| Value | Count | Frequency (%) |
| - | 110924 |
| Value | Count | Frequency (%) |
| + | 5973 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 573555 |
Most frequent character per script
| Value | Count | Frequency (%) |
| - | 110924 | |
| 4 | 97223 | |
| 5 | 81146 | |
| 2 | 79206 | |
| 3 | 59700 | |
| 0 | 57415 | |
| 9 | 53509 | |
| 6 | 14299 | 2.5% |
| 1 | 9336 | 1.6% |
| + | 5973 | 1.0% |
| Other values (2) | 4824 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 573555 |
Most frequent character per block
| Value | Count | Frequency (%) |
| - | 110924 | |
| 4 | 97223 | |
| 5 | 81146 | |
| 2 | 79206 | |
| 3 | 59700 | |
| 0 | 57415 | |
| 9 | 53509 | |
| 6 | 14299 | 2.5% |
| 1 | 9336 | 1.6% |
| + | 5973 | 1.0% |
| Other values (2) | 4824 | 0.8% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| AM | |
|---|---|
| O | |
| C | |
| BP | |
| W | |
| Other values (6) |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.505353264 |
| Min length | 1 |
Characters and Unicode
| Total characters | 176736 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | O |
|---|---|
| 2nd row | Wlg |
| 3rd row | AM |
| 4th row | C |
| 5th row | C |
| Value | Count | Frequency (%) |
| AM | 32518 | |
| O | 26530 | |
| C | 14173 | |
| BP | 8627 | 7.3% |
| W | 7766 | 6.6% |
| T | 6066 | 5.2% |
| S | 6012 | 5.1% |
| Wlg | 5238 | 4.5% |
| EC | 3958 | 3.4% |
| NL | 3752 | 3.2% |
| Value | Count | Frequency (%) |
| am | 32518 | |
| o | 26530 | |
| c | 14173 | |
| bp | 8627 | 7.3% |
| w | 7766 | 6.6% |
| t | 6066 | 5.2% |
| s | 6012 | 5.1% |
| wlg | 5238 | 4.5% |
| ec | 3958 | 3.4% |
| nl | 3752 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 32518 | |
| M | 32518 | |
| O | 26530 | |
| C | 18131 | |
| W | 13004 | 7.4% |
| B | 8627 | 4.9% |
| P | 8627 | 4.9% |
| N | 6517 | 3.7% |
| T | 6066 | 3.4% |
| S | 6012 | 3.4% |
| Other values (4) | 18186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 166260 | |
| Lowercase Letter | 10476 | 5.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 32518 | |
| M | 32518 | |
| O | 26530 | |
| C | 18131 | |
| W | 13004 | 7.8% |
| B | 8627 | 5.2% |
| P | 8627 | 5.2% |
| N | 6517 | 3.9% |
| T | 6066 | 3.6% |
| S | 6012 | 3.6% |
| Other values (2) | 7710 | 4.6% |
| Value | Count | Frequency (%) |
| l | 5238 | |
| g | 5238 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 176736 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 32518 | |
| M | 32518 | |
| O | 26530 | |
| C | 18131 | |
| W | 13004 | 7.4% |
| B | 8627 | 4.9% |
| P | 8627 | 4.9% |
| N | 6517 | 3.7% |
| T | 6066 | 3.4% |
| S | 6012 | 3.4% |
| Other values (4) | 18186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 176736 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 32518 | |
| M | 32518 | |
| O | 26530 | |
| C | 18131 | |
| W | 13004 | 7.4% |
| B | 8627 | 4.9% |
| P | 8627 | 4.9% |
| N | 6517 | 3.7% |
| T | 6066 | 3.4% |
| S | 6012 | 3.4% |
| Other values (4) | 18186 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| 0 | |
|---|---|
| 1 | 334 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 117405 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 117071 | |
| 1 | 334 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 117071 | |
| 1 | 334 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 117071 | |
| 1 | 334 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 117405 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 117071 | |
| 1 | 334 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 117405 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 117071 | |
| 1 | 334 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117405 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 117071 | |
| 1 | 334 | 0.3% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| MidAge | |
|---|---|
| Adult | |
| Old | |
| Teenage | 508 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.038482177 |
| Min length | 3 |
Characters and Unicode
| Total characters | 591543 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MidAge |
|---|---|
| 2nd row | Adult |
| 3rd row | MidAge |
| 4th row | Old |
| 5th row | MidAge |
| Value | Count | Frequency (%) |
| MidAge | 49754 | |
| Adult | 44017 | |
| Old | 23126 | |
| Teenage | 508 | 0.4% |
| Value | Count | Frequency (%) |
| midage | 49754 | |
| adult | 44017 | |
| old | 23126 | |
| teenage | 508 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 116897 | |
| A | 93771 | |
| l | 67143 | |
| e | 51278 | |
| g | 50262 | |
| M | 49754 | |
| i | 49754 | |
| u | 44017 | 7.4% |
| t | 44017 | 7.4% |
| O | 23126 | 3.9% |
| Other values (3) | 1524 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 424384 | |
| Uppercase Letter | 167159 | 28.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| d | 116897 | |
| l | 67143 | |
| e | 51278 | |
| g | 50262 | |
| i | 49754 | |
| u | 44017 | 10.4% |
| t | 44017 | 10.4% |
| n | 508 | 0.1% |
| a | 508 | 0.1% |
| Value | Count | Frequency (%) |
| A | 93771 | |
| M | 49754 | |
| O | 23126 | 13.8% |
| T | 508 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 591543 |
Most frequent character per script
| Value | Count | Frequency (%) |
| d | 116897 | |
| A | 93771 | |
| l | 67143 | |
| e | 51278 | |
| g | 50262 | |
| M | 49754 | |
| i | 49754 | |
| u | 44017 | 7.4% |
| t | 44017 | 7.4% |
| O | 23126 | 3.9% |
| Other values (3) | 1524 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 591543 |
Most frequent character per block
| Value | Count | Frequency (%) |
| d | 116897 | |
| A | 93771 | |
| l | 67143 | |
| e | 51278 | |
| g | 50262 | |
| M | 49754 | |
| i | 49754 | |
| u | 44017 | 7.4% |
| t | 44017 | 7.4% |
| O | 23126 | 3.9% |
| Other values (3) | 1524 | 0.3% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| AV | |
|---|---|
| RPU |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.141195009 |
| Min length | 2 |
Characters and Unicode
| Total characters | 251387 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AV |
|---|---|
| 2nd row | RPU |
| 3rd row | AV |
| 4th row | AV |
| 5th row | AV |
| Value | Count | Frequency (%) |
| AV | 100828 | |
| RPU | 16577 | 14.1% |
| Value | Count | Frequency (%) |
| av | 100828 | |
| rpu | 16577 | 14.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16577 | 6.6% |
| P | 16577 | 6.6% |
| U | 16577 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 251387 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16577 | 6.6% |
| P | 16577 | 6.6% |
| U | 16577 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 251387 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16577 | 6.6% |
| P | 16577 | 6.6% |
| U | 16577 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 251387 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 100828 | |
| V | 100828 | |
| R | 16577 | 6.6% |
| P | 16577 | 6.6% |
| U | 16577 | 6.6% |
| Distinct | 2043 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.520006166 |
|---|---|
| Minimum | 0 |
| Maximum | 9.955130786 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.8978398 |
| Q1 | 5.886104031 |
| median | 6.684611728 |
| Q3 | 7.207859871 |
| 95-th percentile | 7.659171368 |
| Maximum | 9.955130786 |
| Range | 9.955130786 |
| Interquartile range (IQR) | 1.32175584 |
Descriptive statistics
| Standard deviation | 0.8599657331 |
|---|---|
| Coefficient of variation (CV) | 0.1318964601 |
| Kurtosis | -0.1932573719 |
| Mean | 6.520006166 |
| Median Absolute Deviation (MAD) | 0.5947071077 |
| Skewness | -0.6108255283 |
| Sum | 765481.3239 |
| Variance | 0.7395410621 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.907755279 | 2785 | 2.4% |
| 7.090076836 | 2669 | 2.3% |
| 6.396929655 | 2612 | 2.2% |
| 5.347107531 | 2256 | 1.9% |
| 7.244227516 | 2181 | 1.9% |
| 6.684611728 | 1821 | 1.6% |
| 6.214608098 | 1780 | 1.5% |
| 7.495541944 | 1715 | 1.5% |
| 6.802394763 | 1547 | 1.3% |
| 5.991464547 | 1540 | 1.3% |
| Other values (2033) | 96499 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 0.6931471806 | 4 | |
| 1.098612289 | 4 | |
| 1.386294361 | 1 | < 0.1% |
| 1.609437912 | 2 |
| Value | Count | Frequency (%) |
| 9.955130786 | 1 | |
| 9.325453179 | 1 | |
| 8.722580021 | 1 | |
| 8.699514748 | 1 | |
| 8.444192299 | 1 |
| Distinct | 3645 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.519994709 |
|---|---|
| Minimum | 0 |
| Maximum | 9.955130786 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.8978398 |
| Q1 | 5.886104031 |
| median | 6.684611728 |
| Q3 | 7.207859871 |
| 95-th percentile | 7.659171368 |
| Maximum | 9.955130786 |
| Range | 9.955130786 |
| Interquartile range (IQR) | 1.32175584 |
Descriptive statistics
| Standard deviation | 0.8599985062 |
|---|---|
| Coefficient of variation (CV) | 0.1319017184 |
| Kurtosis | -0.1883908053 |
| Mean | 6.519994709 |
| Median Absolute Deviation (MAD) | 0.5947071077 |
| Skewness | -0.6113915001 |
| Sum | 765479.9788 |
| Variance | 0.7395974306 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.907755279 | 2785 | 2.4% |
| 7.090076836 | 2669 | 2.3% |
| 6.396929655 | 2612 | 2.2% |
| 5.347107531 | 2255 | 1.9% |
| 7.244227516 | 2181 | 1.9% |
| 6.684611728 | 1819 | 1.5% |
| 6.214608098 | 1775 | 1.5% |
| 7.495541944 | 1715 | 1.5% |
| 6.802394763 | 1547 | 1.3% |
| 5.991464547 | 1540 | 1.3% |
| Other values (3635) | 96507 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 0.6931471806 | 3 | |
| 0.7419373447 | 1 | < 0.1% |
| 0.9162907319 | 2 | |
| 0.9477893989 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.955130786 | 1 | |
| 9.325453179 | 1 | |
| 8.722580021 | 1 | |
| 8.699514748 | 1 | |
| 8.444170784 | 1 |
| Distinct | 2012 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9999890163 |
|---|---|
| Minimum | 0.8333333333 |
| Maximum | 1.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 0.8333333333 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1.06 |
| Range | 0.2266666667 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.0009291046131 |
|---|---|
| Coefficient of variation (CV) | 0.0009291148182 |
| Kurtosis | 23431.76258 |
| Mean | 0.9999890163 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -135.0970658 |
| Sum | 117403.7105 |
| Variance | 8.632353821 × 107 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 113214 | |
| 1.000068695 | 93 | 0.1% |
| 1.000149701 | 67 | 0.1% |
| 1.000008885 | 63 | 0.1% |
| 0.9999086758 | 42 | < 0.1% |
| 0.9998193642 | 35 | < 0.1% |
| 1.000292553 | 32 | < 0.1% |
| 1.000212443 | 31 | < 0.1% |
| 0.9997842968 | 30 | < 0.1% |
| 0.9997647059 | 29 | < 0.1% |
| Other values (2002) | 3769 | 3.2% |
| Value | Count | Frequency (%) |
| 0.8333333333 | 2 | |
| 0.86 | 1 | |
| 0.9 | 1 | |
| 0.9627272727 | 1 | |
| 0.9725 | 1 |
| Value | Count | Frequency (%) |
| 1.06 | 1 | |
| 1.05 | 1 | |
| 1.03 | 1 | |
| 1.02 | 1 | |
| 1.015652174 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 917.4 KiB |
| 2018 | |
|---|---|
| 2019 | |
| 2017 | |
| 2020 | |
| 2016 | 2407 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 469620 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018 |
|---|---|
| 2nd row | 2018 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2020 |
| Value | Count | Frequency (%) |
| 2018 | 29598 | |
| 2019 | 29329 | |
| 2017 | 29090 | |
| 2020 | 26981 | |
| 2016 | 2407 | 2.1% |
| Value | Count | Frequency (%) |
| 2018 | 29598 | |
| 2019 | 29329 | |
| 2017 | 29090 | |
| 2020 | 26981 | |
| 2016 | 2407 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 144386 | |
| 0 | 144386 | |
| 1 | 90424 | |
| 8 | 29598 | 6.3% |
| 9 | 29329 | 6.2% |
| 7 | 29090 | 6.2% |
| 6 | 2407 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 469620 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 144386 | |
| 0 | 144386 | |
| 1 | 90424 | |
| 8 | 29598 | 6.3% |
| 9 | 29329 | 6.2% |
| 7 | 29090 | 6.2% |
| 6 | 2407 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 469620 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 144386 | |
| 0 | 144386 | |
| 1 | 90424 | |
| 8 | 29598 | 6.3% |
| 9 | 29329 | 6.2% |
| 7 | 29090 | 6.2% |
| 6 | 2407 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 469620 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 144386 | |
| 0 | 144386 | |
| 1 | 90424 | |
| 8 | 29598 | 6.3% |
| 9 | 29329 | 6.2% |
| 7 | 29090 | 6.2% |
| 6 | 2407 | 0.5% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.637911503 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.401530338 |
|---|---|
| Coefficient of variation (CV) | 0.5124398444 |
| Kurtosis | -1.178075211 |
| Mean | 6.637911503 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.07024217255 |
| Sum | 779324 |
| Variance | 11.57040864 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 10905 | |
| 7 | 10746 | |
| 11 | 10512 | |
| 5 | 10287 | |
| 9 | 10108 | |
| 3 | 10042 | |
| 6 | 10015 | |
| 10 | 9948 | |
| 2 | 9535 | |
| 12 | 9389 | |
| Other values (2) | 15918 |
| Value | Count | Frequency (%) |
| 1 | 8761 | |
| 2 | 9535 | |
| 3 | 10042 | |
| 4 | 7157 | |
| 5 | 10287 |
| Value | Count | Frequency (%) |
| 12 | 9389 | |
| 11 | 10512 | |
| 10 | 9948 | |
| 9 | 10108 | |
| 8 | 10905 |
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.19057962 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 14 |
| median | 28 |
| Q3 | 40 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 14.78431431 |
|---|---|
| Coefficient of variation (CV) | 0.5437292811 |
| Kurtosis | -1.194923362 |
| Mean | 27.19057962 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.06562574263 |
| Sum | 3192310 |
| Variance | 218.5759496 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 2822 | 2.4% |
| 49 | 2713 | 2.3% |
| 51 | 2673 | 2.3% |
| 27 | 2539 | 2.2% |
| 8 | 2507 | 2.1% |
| 26 | 2502 | 2.1% |
| 24 | 2499 | 2.1% |
| 50 | 2496 | 2.1% |
| 33 | 2496 | 2.1% |
| 9 | 2492 | 2.1% |
| Other values (42) | 91666 |
| Value | Count | Frequency (%) |
| 1 | 895 | 0.8% |
| 2 | 1942 | |
| 3 | 2386 | |
| 4 | 2363 | |
| 5 | 2252 |
| Value | Count | Frequency (%) |
| 52 | 1073 | 0.9% |
| 51 | 2673 | |
| 50 | 2496 | |
| 49 | 2713 | |
| 48 | 2822 |
Day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.83495592 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.663837733 |
|---|---|
| Coefficient of variation (CV) | 0.5471336817 |
| Kurtosis | -1.153643307 |
| Mean | 15.83495592 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.007585843684 |
| Sum | 1859103 |
| Variance | 75.06208426 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 4372 | 3.7% |
| 20 | 4355 | 3.7% |
| 19 | 4277 | 3.6% |
| 11 | 4097 | 3.5% |
| 12 | 4065 | 3.5% |
| 21 | 4049 | 3.4% |
| 27 | 4004 | 3.4% |
| 16 | 3983 | 3.4% |
| 24 | 3970 | 3.4% |
| 5 | 3967 | 3.4% |
| Other values (21) | 76266 |
| Value | Count | Frequency (%) |
| 1 | 3433 | |
| 2 | 3446 | |
| 3 | 3557 | |
| 4 | 3636 | |
| 5 | 3967 |
| Value | Count | Frequency (%) |
| 31 | 2417 | |
| 30 | 3441 | |
| 29 | 3481 | |
| 28 | 3795 | |
| 27 | 4004 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.104731485 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 21122 |
| Zeros (%) | 18.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.42529502 |
|---|---|
| Coefficient of variation (CV) | 0.677186154 |
| Kurtosis | -1.262712993 |
| Mean | 2.104731485 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.05894428997 |
| Sum | 247106 |
| Variance | 2.031465893 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 25467 | |
| 3 | 24368 | |
| 2 | 22923 | |
| 1 | 22835 | |
| 0 | 21122 | |
| 5 | 687 | 0.6% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 21122 | |
| 1 | 22835 | |
| 2 | 22923 | |
| 3 | 24368 | |
| 4 | 25467 |
| Value | Count | Frequency (%) |
| 6 | 3 | < 0.1% |
| 5 | 687 | 0.6% |
| 4 | 25467 | |
| 3 | 24368 | |
| 2 | 22923 |
| Distinct | 360 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 186.7331204 |
|---|---|
| Minimum | 3 |
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 95 |
| median | 190 |
| Q3 | 275 |
| 95-th percentile | 345 |
| Maximum | 365 |
| Range | 362 |
| Interquartile range (IQR) | 180 |
Descriptive statistics
| Standard deviation | 103.5740564 |
|---|---|
| Coefficient of variation (CV) | 0.5546635548 |
| Kurtosis | -1.18949385 |
| Mean | 186.7331204 |
| Median Absolute Deviation (MAD) | 90 |
| Skewness | -0.06003403236 |
| Sum | 21923402 |
| Variance | 10727.58515 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 213 | 544 | 0.5% |
| 354 | 538 | 0.5% |
| 31 | 536 | 0.5% |
| 340 | 518 | 0.4% |
| 170 | 517 | 0.4% |
| 233 | 516 | 0.4% |
| 184 | 516 | 0.4% |
| 318 | 515 | 0.4% |
| 24 | 513 | 0.4% |
| 52 | 512 | 0.4% |
| Other values (350) | 112180 |
| Value | Count | Frequency (%) |
| 3 | 189 | |
| 4 | 228 | |
| 5 | 161 | |
| 6 | 165 | |
| 7 | 192 |
| Value | Count | Frequency (%) |
| 365 | 216 | |
| 364 | 152 | |
| 363 | 157 | |
| 362 | 186 | |
| 361 | 245 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 114.8 KiB |
| False | |
|---|---|
| True | 4023 |
| Value | Count | Frequency (%) |
| False | 113382 | |
| True | 4023 | 3.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 114.8 KiB |
| False | |
|---|---|
| True | 3433 |
| Value | Count | Frequency (%) |
| False | 113972 | |
| True | 3433 | 2.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 114.8 KiB |
| False | |
|---|---|
| True | 849 |
| Value | Count | Frequency (%) |
| False | 116556 | |
| True | 849 | 0.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 114.8 KiB |
| False | |
|---|---|
| True | 706 |
| Value | Count | Frequency (%) |
| False | 116699 | |
| True | 706 | 0.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 114.8 KiB |
| False | |
|---|---|
| True | 145 |
| Value | Count | Frequency (%) |
| False | 117260 | |
| True | 145 | 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 114.8 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 117405 |
| Distinct | 1118 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1544076146 |
|---|---|
| Minimum | 1480550400 |
| Maximum | 1606694400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 917.4 KiB |
Quantile statistics
| Minimum | 1480550400 |
|---|---|
| 5-th percentile | 1487203200 |
| Q1 | 1512345600 |
| median | 1543536000 |
| Q3 | 1574985600 |
| 95-th percentile | 1601337600 |
| Maximum | 1606694400 |
| Range | 126144000 |
| Interquartile range (IQR) | 62640000 |
Descriptive statistics
| Standard deviation | 36653780.4 |
|---|---|
| Coefficient of variation (CV) | 0.02373832436 |
| Kurtosis | -1.188244809 |
| Mean | 1544076146 |
| Median Absolute Deviation (MAD) | 31449600 |
| Skewness | 0.01692927592 |
| Sum | 1.8128226 × 1014 |
| Variance | 1.343499618 × 1015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1487203200 | 185 | 0.2% |
| 1486512000 | 179 | 0.2% |
| 1487289600 | 168 | 0.1% |
| 1513209600 | 167 | 0.1% |
| 1596153600 | 166 | 0.1% |
| 1481068800 | 162 | 0.1% |
| 1486080000 | 161 | 0.1% |
| 1606348800 | 160 | 0.1% |
| 1597276800 | 159 | 0.1% |
| 1529020800 | 159 | 0.1% |
| Other values (1108) | 115739 |
| Value | Count | Frequency (%) |
| 1480550400 | 138 | |
| 1480636800 | 109 | |
| 1480896000 | 98 | |
| 1480982400 | 114 | |
| 1481068800 | 162 |
| Value | Count | Frequency (%) |
| 1606694400 | 146 | |
| 1606521600 | 13 | < 0.1% |
| 1606435200 | 153 | |
| 1606348800 | 160 | |
| 1606262400 | 135 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 134.0 | F | AV | M | 134.0 | GHI000112669 | TB | 40-44 | O | 0 | MidAge | AV | 4.897840 | 4.897840 | 1.0 | 2018 | 6 | 26 | 25 | 0 | 176 | False | False | False | False | False | False | 1529884800 |
| 1 | 800.0 | F | RP | NE | 800.0 | GHI000780038 | TB | 18-19 | Wlg | 0 | Adult | RPU | 6.684612 | 6.684612 | 1.0 | 2018 | 11 | 48 | 30 | 4 | 334 | True | False | False | False | False | False | 1543536000 |
| 2 | 270.0 | M | AV | M | 270.0 | GHI000437510 | TB | 35-39 | AM | 0 | MidAge | AV | 5.598422 | 5.598422 | 1.0 | 2019 | 3 | 10 | 5 | 1 | 64 | False | False | False | False | False | False | 1551744000 |
| 3 | 600.0 | M | AV | M | 600.0 | GHI000140582 | TB | 55-59 | C | 0 | Old | AV | 6.396930 | 6.396930 | 1.0 | 2019 | 1 | 5 | 31 | 3 | 31 | True | False | False | False | False | False | 1548892800 |
| 4 | 270.0 | F | AV | M | 270.0 | GHI001420547 | TB | 30-34 | C | 0 | MidAge | AV | 5.598422 | 5.598422 | 1.0 | 2020 | 9 | 37 | 9 | 2 | 253 | False | False | False | False | False | False | 1599609600 |
| 5 | 1360.0 | F | RP | PP | 1360.0 | GHI000096091 | TB | 25-29 | EC | 0 | Adult | RPU | 7.215240 | 7.215240 | 1.0 | 2017 | 9 | 36 | 8 | 4 | 251 | False | False | False | False | False | False | 1504828800 |
| 6 | 2020.0 | F | AV | PP | 2020.0 | GHI000157538 | TB | 35-39 | AM | 0 | MidAge | AV | 7.610853 | 7.610853 | 1.0 | 2019 | 5 | 20 | 14 | 1 | 134 | False | False | False | False | False | False | 1557792000 |
| 7 | 160.0 | M | AV | NE | 160.0 | GHI000134666 | TB | 40-44 | S | 0 | MidAge | AV | 5.075174 | 5.075174 | 1.0 | 2017 | 7 | 28 | 11 | 1 | 192 | False | False | False | False | False | False | 1499731200 |
| 8 | 960.0 | M | AV | M | 960.0 | GHI000210735 | TB | 45-49 | T | 0 | MidAge | AV | 6.866933 | 6.866933 | 1.0 | 2018 | 10 | 41 | 8 | 0 | 281 | False | False | False | False | False | False | 1538956800 |
| 9 | 400.0 | F | AV | M | 400.0 | GHI001182494 | TB | 20-24 | C | 0 | Adult | AV | 5.991465 | 5.991465 | 1.0 | 2017 | 11 | 47 | 20 | 0 | 324 | False | False | False | False | False | False | 1511136000 |
Last rows
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 117395 | 1800.0 | F | AV | U | 1800.0 | GHI000609805 | TB | 20-24 | AM | 0 | Adult | AV | 7.495542 | 7.495542 | 1.0 | 2020 | 3 | 12 | 20 | 4 | 80 | False | False | False | False | False | False | 1584662400 |
| 117396 | 1200.0 | F | AV | O | 1200.0 | GHI000327936 | TB | 20-24 | S | 0 | Adult | AV | 7.090077 | 7.090077 | 1.0 | 2018 | 10 | 43 | 26 | 4 | 299 | False | False | False | False | False | False | 1540512000 |
| 117397 | 1490.0 | F | AV | M | 1490.0 | GHI000886793 | TB | 30-34 | BP | 0 | MidAge | AV | 7.306531 | 7.306531 | 1.0 | 2017 | 1 | 3 | 18 | 2 | 18 | False | False | False | False | False | False | 1484697600 |
| 117398 | 880.0 | F | AV | NE | 880.0 | GHI000431616 | TB | 50-54 | T | 0 | Old | AV | 6.779922 | 6.779922 | 1.0 | 2018 | 7 | 28 | 13 | 4 | 194 | False | False | False | False | False | False | 1531440000 |
| 117399 | 1000.0 | F | AV | M | 1000.0 | GHI000102929 | TB | 65+ | O | 0 | Old | AV | 6.907755 | 6.907755 | 1.0 | 2018 | 2 | 6 | 7 | 2 | 38 | False | False | False | False | False | False | 1517961600 |
| 117400 | 1050.0 | F | AV | NE | 1050.0 | GHI000263441 | TB | 45-49 | BP | 0 | MidAge | AV | 6.956545 | 6.956545 | 1.0 | 2017 | 3 | 10 | 6 | 0 | 65 | False | False | False | False | False | False | 1488758400 |
| 117401 | 680.0 | M | RP | M | 680.0 | GHI000142341 | TB | 25-29 | C | 0 | Adult | RPU | 6.522093 | 6.522093 | 1.0 | 2017 | 8 | 35 | 28 | 0 | 240 | False | False | False | False | False | False | 1503878400 |
| 117402 | 630.0 | M | AV | NE | 630.0 | GHI000099606 | TB | 45-49 | Wlg | 0 | MidAge | AV | 6.445720 | 6.445720 | 1.0 | 2017 | 6 | 26 | 28 | 2 | 179 | False | False | False | False | False | False | 1498608000 |
| 117403 | 216.0 | M | AV | M | 216.0 | GHI000135471 | TB | 45-49 | O | 0 | MidAge | AV | 5.375278 | 5.375278 | 1.0 | 2019 | 11 | 47 | 22 | 4 | 326 | False | False | False | False | False | False | 1574380800 |
| 117404 | 340.0 | F | AV | NE | 340.0 | GHI001140992 | TB | 65+ | C | 0 | Old | AV | 5.828946 | 5.828946 | 1.0 | 2017 | 5 | 18 | 5 | 4 | 125 | False | False | False | False | False | False | 1493942400 |
Most frequent
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 120 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 20-24 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2017 | 3 | 12 | 20 | 0 | 79 | False | False | False | False | False | False | 1489968000 | 6 |
| 125 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 25-29 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 51 | 21 | 2 | 356 | False | False | False | False | False | False | 1482278400 | 5 |
| 157 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 30-34 | O | 0 | MidAge | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 51 | 21 | 2 | 356 | False | False | False | False | False | False | 1482278400 | 5 |
| 167 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 30-34 | O | 0 | MidAge | AV | 5.337538 | 5.337538 | 1.0 | 2017 | 3 | 10 | 10 | 4 | 69 | False | False | False | False | False | False | 1489104000 | 5 |
| 171 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 30-34 | O | 0 | MidAge | AV | 5.337538 | 5.337538 | 1.0 | 2017 | 3 | 12 | 22 | 2 | 81 | False | False | False | False | False | False | 1490140800 | 5 |
| 327 | 210.0 | F | AV | M | 210.0 | GHI000112669 | TB | 25-29 | O | 0 | Adult | AV | 5.347108 | 5.347108 | 1.0 | 2017 | 12 | 51 | 22 | 4 | 356 | False | False | False | False | False | False | 1513900800 | 5 |
| 100 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 20-24 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 49 | 7 | 2 | 342 | False | False | False | False | False | False | 1481068800 | 4 |
| 109 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 20-24 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2017 | 1 | 4 | 24 | 1 | 24 | False | False | False | False | False | False | 1485216000 | 4 |
| 123 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 25-29 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 49 | 7 | 2 | 342 | False | False | False | False | False | False | 1481068800 | 4 |
| 135 | 208.0 | F | AV | M | 208.0 | GHI000112669 | TB | 25-29 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2017 | 2 | 6 | 10 | 4 | 41 | False | False | False | False | False | False | 1486684800 | 4 |